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TITLE OF THE INVENTION 

SYNTHETIC HEPARANASE MOLECULES AND USES THEREOF 

FIELD OF THE INVENTION 
5 The present invention relates to synthetically produced, enzymatically active 

heparanase molecules that are capable of expression in high yield heterologous expression 
systems. Also provided herein are methods of expressing mammalian heparanase in 
heterologous expression systems. 

10 BACKGROUND OF THE INVENTION 

Heparan sulfate proteoglycans (HSPGs) are ubiquitous macromolecules found in 
the extracellular matrix (ECM) and on the cell surface that contribute to the maintenance of cell- 
cell and cell-ECM interactions. HSPGs are composed of several heparan sulfate (HS) chains 
covalently linked to a protein core. Heparan sulfate facilitates binding of structural ECM 

15 proteins such as fibronectin, laminin, and collagen, to the cell surface and to other ECM proteins, 
suggesting a role for this glycosaminoglycan in self-assembly and insolubility of ECM 
components, as well as in cell adhesion and locomotion. Biologically functional molecules (e.g. 
growth factors, enzymes, and chemokines) also bind to the cell surface and the ECM via the HS 
chains. Because of the importance of maintaining proper cell-cell and cell-ECM interactions, 

20 HSPGs play crucial structural or regulatory roles in the extracellular milieu, modulating 

important normal and pathological processes ranging from embryogenesis, morphogenesis and 
development to inflammation angiogenesis and cancer metastasis. 

Beside these structural and cell-matrix anchoring roles, the structural diversity of 
HS (Esko et al.J. Clin. Invest 108:169-173 (2001); Turnbull et al. Trends Cell Biol 11: 75-82 

25 (2001)) allows HSPGs to interact with a variety of extracellular signaling proteins such as 

growth factors, enzymes, and chemokines. Among these, some growth factors play important 
roles in tumor growth, invasiveness and angiogenesis, such as fibroblast growth factors (FGF1 
and FGF2), vascular endothelial growth fector (VEGF), hepatocyte growth factor, transforming 
growth factor p and platelet-derived growth factor. In several cases, HSPGs act not only as a 

30 depot for these signaling molecules, activating or stabilizing them, but also participate in ligand- 
receptor interactions, such as the binding of FGF2 to the diverse isoforms of the FGF receptor 
(Chang et al. FASEB J. 14: 137-144 (2000)). ' 

Heparanase is an endo ^-^-glucuronidase released by platelets, placental 
trophoblasts, and leukocytes, that specifically degrades heparan sulfate by cleaving the 

-1- 



Hnnv nrnuiHoH hu I LQPTO frnm tho PAHP Imano riataKaco nn tV7H\SUOt\t\A 



* ITR006OPV 



glycosidic bond through a hydrolase mechanism. Degradation of HSPG by heparanase results in 
the release of growth factors such as bFGF, uPA, and tPA that may either initiate neo- 
angiogenesis or potentiate ECM degradation. Additionally, HS cleavage by heparanase allows 
cells to migrate through the basal membranes (BM) and traverse the ECM barriers. HS 
5 degradation plays an important role in numerous physiological processes by allowing cells to 
quickly respond to extracellular changes. Therefore, inhibition of heparanase activity could 
affect pathologies correlated with altered cell migration such as inflammation, metastasis, and 
autoimmune disorders. 

Due to this pivotal role, heparanase is a potential novel target for the development 
10 of antitumor, antimetastasis or anti-inflammatory drugs. For purposes of drug development, 
heparanase has a significant advantage over the matrix metalloproteases, which are also ECM- 
modifying enzymes, because it is likely a single gene product and not part of a complex family 
of related proteins. Exploiting heparanase as a drug target is presently hampered by both the 
scarcity of reliable high-throughput assays and by its complex biogenesis, which renders the 
1 5 production of large amounts of active protein a difficult task. 

Human heparanase cDNA encodes a protein that is initially synthesized as a pre- 
pro- protein with a signal peptide sequence removed by signal peptidase upon translocation into 
the ER. The resulting 65 kDa pro-form is further processed by removing the 157 N-terminal 
amino acids to yield the mature 50 kDa heparanase. The 50 kDa protein has a specific activity at 
20 least 100 fold higher than the unprocessed 65 kDa precursor (Vlodavsky et al. Nat. Med. 5: 793- 
802 (1999)). Interestingly, the 50 kDa protein is inactive if expressed as such in mammalian 
cells (Hulett et al. Nat. Med. 5: 803-809(1999)) and the active form of the enzyme was 
proposed to be a heterodimer between the 50 kDa fragment and an 8 kDa fragment arising from 
the excision of an intervening 6 kDa peptide by unidentified proteolytic enzyme(s) (Fairbanks et 
25 al. J. Biol. Chem. 274: 29587-29590 (1999)). Consistent with this hypothesis, McKenzie et al. 
(Biochem J. 373: 423-435 (2003)) produced active heterodimeric heparanase in insect cells and 
confirmed that the 8 kDa subunit is necessary for heparanase activity. 

Endogenous heparanase can be purified from various sources; however, low 
heparanase expression levels lead to the necessity for laborious and expensive purification 
30 procedures. For example, Toyoshima & Nakajima (J. Biol. Chem. 274: 24153-24160 (1999)) 
described a process for purifying endogenous human heparanase from platelets that requires four 
different chromatographic steps and lasts five days. 

Another drawback to the purification of endogenous heparanase is that overall 
yields are characteristically low. For instance, Fairbanks et al. (J. Biol. Chem. 274, 29587- 
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29590, (1999)) report the purification of only 22 of heparanase from platelets, with a yield of 
6%. Similarly, Fuks and colleagues (U.S. Patent No. 5,362,641) describe a 4000-fold 
purification of heparanase from 1.4 kg of protein derived from the human hepatoma cell line Sk- 
Hep-1, producing only 6.5 \ig of purified heparanase protein with a yield of 1.9%. A 240,000- 
5 fold purification of heparanase from the same cell line was disclosed by Pecker et al. (U.S. 
Patent No 5,968,822) which required over 500 liters of cell culture. 

The identification and cloning of the human heparanase gene (Vlodavksy et al, 
Nature Med. 5: 793-802 (1999); Hulett et al, Nature Med. 5: 803-809 (1999); Toyoshima & 
Nakajima, /. Biol Chem. 274: 24153-24160 (1999)) allowed the recombinant expression of 

10 heparanase protein in heterologous expression systems. However, serious deficiencies have 

been noted with such heterologous expression systems in relation to heparanase production. For 
example, Ben-Artzi et al. (WO 99/57244) describe the expression of recombinant human 
heparanase in bacterial, mammalian, yeast and insect cells. Although heparanase expression was 
obtained, there was no detectable enzymatic activity associated with the recombinant protein 

15 when E.coli was host cell, and only the 70 kDa unprocessed precursor was detected when 
heparanase was expressed in the yeast Pichia pastoris. 

Ben-Artzi and colleagues {supra) also describe the expression of recombinant 
heparanase in mammalian cells, namely, human kidney fibroblasts (293), baby hamster kidney 
cells (BHL21) and Chinese hamster ovary cells (CHO). However, these expression systems are 

20 known to have low yields and high associated costs. Furthermore, despite the fact that 
processing of the recombinant full-length precursor to yield the active, mature protein is 
observed in these cells, no homogeneously processed protein is obtained because the processing 
reaction is inefficient Additionally, the use of expression vectors driving the secretion of 
heparanase does not lead to production of recombinant heparanase in the conditioned medium of 

25 CHO cells, which have to be further stimulated to secrete heparanase by addition of calcium 
ionophore or PMA. Only a minor fraction of the secreted protein appears to be correctly 
processed in this system. 

The production of heparanase in insect cell expression systems such as Sf21 or 
High five cells are described in the art (WO 99/57244, WO 99/U798, US Patent No. 5,968,822; 

30 US Patent No* 6,348,344; and US Patent No. 6,190,875). However, although efficient secretion 
into the growth medium was observed with such prior methods, specific activity of the enzyme 
was very low and no correct processing was observed. For example, WO 99/57244 describes the 
introduction of protease cleavage sites downstream of positions 1 19 or 157 of the heparanase 
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protein in order to generate a correctly processed heparanase in insect cell expression systems. 
However, these constructs were not shown to be enzymatically active. 

McKenzie et al. (supra) described the production of active heterodimeric 
heparanase in insect cells. This system, however, has the disadvantage of requiring the 

5 simultaneous production of two different recombinant proteins (the 8 kDa and the 50 kDa 
subunits). Since admixture of the isolated 8 kDa and 50 kDa domains does not result in 
heparanase activation, the successful recovery of an active heterodimer by simultaneous 
expression probably relies on a co-translational formation of the heterodimeric complex. 
Treatment of this complex with glycanase lead to its dissociation and to the precipitation of the 

10 50 kDa subunit, suggesting a poor stability and solubility. 

Despite the methods described above to obtain heparanase in active or inactive 
form, it would be advantageous to produce biologically active heparanase molecules that are 
capable of expression in high yield, low cost heterologous expression systems. Said molecules 
can be used in inhibitor screening assays for the development of therapeutics or pharmaceuticals 

1 5 to inhibit and/or treat metastatic growth and/or inflammation. 

SUMMARY OF THE INVENTION 

The present invention provides synthetic nucleic acid molecules that encode 
biologically active mammalian heparanase, wherein the nucleic acid molecules are capable of 

20 expression in high yield heterologous expression systems. The synthetic heparanase molecules 
provided herein present a significant advance over wild-type heparanase, which is expressed at 
low levels in mammalian systems and improperly processed in heterologous expression systems. 
The synthetic molecules of the present invention can be used in inhibitor screening assays for the 
development of therapeutics or pharmaceuticals to inhibit and/or treat metastatic growth, 

25 autoimmune disorders, and/or inflammation. 

Also provided herein are synthetically produced, biologically active, mammalian 
heparanase polypeptides and heparanase polypeptides comprising endoproteinase consensus 
cleavage sites that are capable of biological activity upon incubation with the appropriate 
enzyme. 

30 The present invention further provides methods for expressing mammalian 

heparanase in heterologous expression systems, said methods resulting in high levels of 
biologically active heparanase expression. 



-4- 



ITR0060PV 



As used throughout the specification and in the appended claims, the singular 
forms "a," "an " and "the" include the plural reference unless the context clearly dictates 
otherwise. 

As used throughout the specification and appended claims, the following 
5 definitions and abbreviations apply: 

A "conservative amino acid substitution" refers to the replacement of one amino 
acid residue by another, chemically similar, amino acid residue. Examples of such conservative 
substitutions are: substitution of one hydrophobic residue (isoleucine, leucine, valine, or 
methionine) for another; substitution of one polar residue for another polar residue of the same 
10 charge (e.g., arginine for lysine; glutamic acid for aspartic acid). 

The term "mammalian" refers to any mammal, including a human being. 
The term "treatment" refers to both therapeutic treatment and prophylactic or 
preventative measures. Those in need of treatment include those already with the disorder as 
well as those prone to have the disorder or those in which the disorder is to be prevented. A 
15 "disorder" is any condition that would benefit from treatment with molecules identified using the 
nucleic acid molecules and polypeptides described herein. Such disorders include, but are not 
limited to, cancer, inflammation and autoimmune disorders. 

The term "vector" refers to some means by which DNA fragments can be 
introduced into a host organism or host tissue. There are various types of vectors including 
20 plasmid, virus (including adenovirus), bacteriophages and cosmids. 

"Biologically active" refers to a protein having structural, regulatory, or 
biochemical functions attending a naturally occurring molecule or isoform thereof. In the 
context of heparanase, "biologically active" proteins comprise heparanase enzymatic activity. 
"Substantially similar" means that a given sequence shares at least 80%, 
25 preferably 90%, more preferably 95%, and even more preferably 99% homology with a 
reference sequence. In the present invention, the reference sequence can be the full-length 
human heparanase nucleotide or amino acid sequence, or the nucleotide or amino acid sequence 
of the 8 kDa or 50 fcDa heparanase fragments, as dictated by the context of the text Thus, a 
heparanase protein sequence that is "substantially similar" to the 8 kDa human heparanase 
30 fragment will share at least 80% homology with the 8 kDa human heparanase fragment, 
preferably 90% homology, more preferably 95% homology and even more preferably 99% 
homology. Whether a given heparanase protein or nucleotide sequence is "substantially similar" 
to a reference sequence can be determined for example, by comparing sequence information 
using sequence analysis software such as the GAP computer program, version 6.0, available 
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from the University of Wisconsin Genetics Computer Group (UWGCG). The GAP program 
utilizes the alignment method of Needleman and Wunsch (J. Mol. Biol. 48:443, 1970), as revised 
by Smith and Waterman (Adv. Appl. Math. 2:482, 1981). 

A "gene" refers to a nucleic acid molecule whose nucleotide sequence codes for a 
5 polypeptide molecule. Genes may be uninterrupted sequences of nucleotides or they may include 
such intervening segments as introns, promoter regions, splicing sites and repetitive sequences. 
A gene can be either KNA or DNA. A preferred gene is one that encodes the invention peptide. 

The term "nucleic acid" or "nucleic acid molecule" is intended for ribonucleic 
acid (RNA) or deoxyribonucleic acid (DNA), probes, oligonucleotides, fragment or portions 
10 thereof, and primers. DNA can be either complementary DNA (cDNA) or genomic DNA, e.g. a 
gene encoding the invention peptide. 

"Wild-type heparanase" or "wild-type protein" or "wt protein" refers to a protein 
comprising a naturally occurring sequence of amino acids or variant thereof. The amino acid 
sequence of wild-type human heparanase is available in the art (Vlodavksy et al, Nature Med. 5: 
1 5 793-802 (1999); Hulett et al, Nature Med. 5: 803-809 (1999); Toyoshima & Nakajima, J. Biol. 
Chem. 274: 24153-24160 (1999); which are herein incorporated by reference in their entirety). 

"Wild-type heparanase gene" refers to a gene comprising a sequence of 
nucleotides that encodes a naturally occurring heparanase protein, including proteins of human 
origin or proteins obtained from another organism, including, but not limited to, insects such as 
20 Drosophila, amphibians such as Xenopus, and mammals such as rat, mouse and rhesus monkey. 
The nucleotide sequence of the human heparanase gene is available in the art (Genbank 
AccessionNo. AF155510; Toyoshima and Nakajima, J. Biol. Chem. 274(34): 24153-24160 
(1999), which are hereby incorporated by reference in their entirety). 

"Substantially free from other proteins" or "substantially purified" means at least 
25 90%, preferably 95%, more preferably 99%, and even more preferably 99.9%, free of other 
proteins. Thus, a heparanase protein preparation that is substantially free from other proteins 
will contain, as a percent of its total protein, no more than 10%, preferably no more than 5%, 
more preferably no more than 1%, and even more preferably no more than 0.1%, of non- 
heparanase proteins. Whether a given heparanase protein preparation is substantially free from 
30 other proteins can be determined by such conventional techniques of assessing protein purity as, 
e.g., sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) combined with 
appropriate detection methods, e.g., silver staining or immunoblotting. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 depicts the biosynthesis of human heparanase in mammalian cells. 
FIGURE 2, Panel A, shows a schematic view of the heparanase constructs with 
engineered TEV cleavage sites. Panel B, left: Western blot analysis of wild-type heparanase 
5 expressed in Cos 7 cells (1 lane), hepTEVl 10 0ane 2), hepTEVl 10 after 16 hours incubation 
with (lane 3) or without (lane 4) 0.5 uM TEV protease, hepTEVl 10/158 (lane 5), 
hepTEVl 10/158 after 16 hours incubation with (lane 6) or without (lane 7) 0.5 uM TEV 
protease. Right. Heparanase activity of the same samples using the fluorimetric method. 

FIGURE 3 : Panel A: Multiple sequence alignment of heparanase against related 
10 sequences. Predicted secondary structure elements are shown above the alignment (arrows = 
beta strands, cylinders = helix). The position of the two cleavage sites is indicated by black 
triangles. The region of the excised heparanase segment substituted by the Hyaluronidase 
fragment is surrounded by a grey box. Panel B: Schematic view of the TIM barrel architecture. 
The location of the excised heparanase segment is indicated with the cleavage points shown as 
15 triangles. If present, the segment most likely obscures binding of the substrate (grey arrow) by 
beta/alpha units 1 and 2. Design of a shorter loop (dotted line) removes this coiistraint, leading 
to an active enzyme while, at the same time, maintaining the structural integrity of the enzyme. 

FIGURE 4, Panel A, shows a schematic view of the single chain heparanase 
constructs. Panel B, left, shows western blot analysis of wild-type heparanase or single chain 
20 constructs expressed in Cos 7 cells. Panel B, right shows heparanase activity of the same 
samples using the radiometric assay. 

FIGURE 5: Left, Western blot analysis of wild-type heparanase or single chain 
constructs expressed in Sf9 cells. Right: Heparanase activity of the same samples using the 
radiometric assay. 



25 



DETAILED DESCRIPTION OF THE INVENTION 

Heparanase is a mammalian enzyme that degrades heparan sulfate (HS) by 
cleaving the glycosidic bond through a hydrolase mechanism. HS degradation plays an 
important role in numerous physiological processes by allowing cells to quickly respond to 
30 extracellular changes by altering cell-cell and cell-ECM interactions. Because of the importance 
of these interactions, inhibition of heparanase activity could affect several pathologies such as 
tumor cell metastasis, T-cell mediated delayed type hypersensitivity and autoimmunity. 

Several lines of evidence suggest that heparanase is involved in tumor cell 
metastasis. First, expression levels of heparanase correlate with the metastatic potential of 
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several tumors and tumor cell lines. Second, patients with aggressive metastatic disease have 
measurable heparanase activity in their urine. This observation is not seen with all cancer r 
patients. Additionally, the inhibition of heparanase activity by non-anticoagulant heparin 
derivatives reduced the incidence of metastases by B16 melanoma, Lewis lung carcinoma and 
5 mammary adenocarcinoma cells. Finally, transfection of nonmetastatic murine cells with the 
human heparanase gene resulted in increased mortality and metastasis in two mouse models. 

The human heparanase gene does not share substantial homology with any other 
known proteins. At the time of its discovery, evidence suggested that heparanase is not a 
member of a gene family, but rather a single gene or at least the dominant endoglucuronidase 
10 involved in HSPG degradation. A second heparanase (hpa2), which shares 34% homology at the 
amino acid level, was later identified; however, hpa2 seems to serve a different function based 
on its tissue distribution. The absence of closely related proteins that accomplish analogous 
tasks, coupled with the above evidence demonstrating a role for heparanase in metastatic growth, 
make heparanase an excellent target for the development of therapeutics in these areas. 
15 FIGURE 1 depicts the biosynthesis of human heparanase. Briefly, the heparanase 

cDNA encodes a protein that is initially synthesized as a pre-pro- protein with a signal peptide 
sequence (residues Met'-Ala 35 ) removed by signal peptidase upon translocation into the ER. The 
resulting 65 kDa pro-form is further processed by removing the 157 N-terminal amino acids to 
yield the mature 50 kDa heparanase. The 50 kDa protein has a specific activity at least 100 fold 
20 higher than the unprocessed 65 kDa precursor (Vlodavsky et al. Nat. Med. 5: 793-802 (1999)). 
The active form of the enzyme was proposed to be a heterodimer between the 50 kDa fragment 
and an 8 kDa fragment arising from the excision of an intervening 6 kDa peptide (residues 
Glu ,09 -Gln 157 ) by unidentified proteolytic enzyme(s) (hereinafter "intervening fragment" or "6 
kDa fragment" (Fairbanks et al. J. Biol. Chem. 274: 29587-29590 (1999). 
25 Despite recent evidence showing that the 8 kDa subunit is necessary for 

heparanase activity (McKenzie et al. Biochem J. 373: 423-435 (2003)), the role of the 8 kDa 
subunit in the activation process of heparanase remained unclear prior to the studies disclosed 
herein: it could function as an essential subunit or, alternatively, act as a chaperone and be 
dispensable after having accomplished this function. It was also not clear whether other 
30 components besides the 8 kDa subunit are necessary to eUcit heparanase activation. Multiple 

sequence alignments and secondary structure prediction lead to a model of the human heparanase 
according to which the protein adopts a TIM barrel fold, as found in several glycosidases (Hulett 
et al. Biochemistry 39:15659-15667 (2000)). This common fold motif usually consists of 8 
alternating a-helices and p-strands. Within the 50 kDa fragment clear homology is observed 
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only starting with the 3* o/p* unit of the TIM barrel fold, suggesting either that heparanase 
adopts a novel fold consisting of only 6 a/p* units or that other parts of the protein contribute the 
missing units. 

It was postulated that the 8-kDa fragment might contribute the missing structural 
5 elements. Following this hypothesis, a model of the secondary structure of heparanase, based on 
multiple sequence alignments, was built to design single chain heparanase molecules having the 
8 kDa and the 50 kDa subunits covalently linked together, as described herein. The present 
invention shows that connecting the 8 kDA and 50kDa fragments with a linker results in 
constitutively active, single chain heparanase molecules that do not require proteolytic 
10 processing. In exemplary embodiments of the invention, the two fragments were connected by 
grafting of a loop derived from H. maUinensis hyaluronidase or with a linker comprising three 

glycine-serine repeats. 

It is also shown herein that by engineering endoproteinase cleavage sites at about 
the N and C termini of the 6 kDa intervening fragment, proteolytic processing at both sites of a 

15 purified protein leads to heparanase activation in the absence of other components. In an i 
exemplary embodiment of this aspect of the invention, tobacco etch virus protease cleavage sites 
are added at the N and C termini of the 6 kDa intervening fragment, resulting in active 
heparanase. The present invention provides evidence of human heparanase adopting a canonical 
TIM barrel fold and, advantageously, provides methods for facile production of active enzyme 

20 molecules for the identification of specific inhibitors. 

The engineered proteins, nucleic acid molecules and methods of the present 
invention for expressing biologically active heparanase in heterologous expression systems, 
particularly insect cells, characteristically produce yields of 0.5 - 5 mg/1. Furthermore, these 
proteins are efficiently secreted into the growth medium, whereas in mammalian cells the 

25 authentic human enzyme is mainly retained inside cells or associated with the cell membranes 
(Vlodavsky et al, Semin. Cancer Biol 12: 121-129 (2002)). 

Accordingly, the present invention relates to synthetic nucleic acid molecules that 
encode mammalian heparanase, wherein the nucleic acid molecules are capable of expression in 
high yield heterologous expression systems. The synthetic heparanase molecules provided 

30 herein present a significant advance over wild-type heparanase, which are expressed at low 
levels in mammalian systems and improperly processed in heterologous expression systems. 
The synthetic molecules of the present invention can be used in inhibitor screening assays for the 
development of therapeutics or pharmaceuticals to inhibit and/or treat metastatic growth and/or 
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inflammation. Said synthetic molecules are also useful in the development of therapeutics for 
the treatment of autoimmunity and inflammation. 

In one aspect of the present invention, synthetic nucleic acid molecules 
comprising a sequence of nucleotides that encode a mammalian heparanase protein are provided, 

5 the sequence of nucleotides comprising two consensus cleavage sites recognized by an 
endoproteinase, the cleavage sites located between residues 100 and 168 of the heparanase 
protein. This aspect of the present invention provides synthetic nucleic acid molecules that can 
be used in methods for carrying out the proteolytic processing of the heparanase protein, similar 
to the biosynthesis of wild-type heparanase, resulting in a biologically active enzyme. 

10 Also provided herein are substantially pure polypeptides encoded by the nucleic 

acid molecule described above. 

In a preferred embodiment of the invention, the mammalian heparanase protein is 

human heparanase. 

The two consensus cleavage sites can be introduced anywhere between residues 

15 100 and 168 of the human heparanase protein, provided that after incubation with the appropriate 
enzyme, the resulting fragments comprise at least one fragment that is substantially similar to the 
wild-type 8 kDa fragment and at least one fragment that is substantially similar to the wild-type 
50 kDa fragment. In a preferred embodiment of Ihe invention, the consensus cleavage sites are 
located before residues Gl 10 and K158 of the human heparanase protein, resulting in a first 

20 fragment of 8 kDa, a second "intervening fragment" of 6 kDa and a third fragment of 50 kDa 
after incubation with the appropriate enzyme. 

It is understood by one of skill in the art that cleavage sites corresponding to any 
endoproteinase can be engineered into the heparanase molecule to obtain active, heterodimeric 
heparanase, including, but not limited to, cleavage sites from tobacco etch virus, 3C protease 

25 from picornavirus, thrombin, Factor Xa and enterokinase. In a preferred embodiment of the 
invention, the cleavage sites are from tobacco etch virus. 

In another aspect of the present invention, there is provided a constitutively 
active, single-chain mammalian heparanase nucleic acid molecules comprising a portion that 
encodes a mammalian heparanase protein, the protein coding portion consisting essentially of a 

30 sequence of nucleotides encoding an N-terminal fragment of about 8 kDa, a synthetic linker, and 
a sequence of nucleotides encoding a C-terminal fragment of about 50 kDa. This aspect of the 
present invention provides synthetic genes encoding heparanase that are constitutively active 
without proteolytic processing, wherein the synthetic gene is engineered to substantially remove 
the 6 kDa "intervening fragment" and replace said intervening fragment with a smaller linker. 
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In preferred embodiments of this aspect of the present invention, the mammalian 
heparanase protein is a human heparanase. 

Also provided herein is a purified synthetic heparanase protein encoded by the 
constitutively active, single-chain mammalian heparanase gene described above. 

5 Any sequence encoding a peptide comprising from about 1 to about 67 residues 

can be used as a linker in this aspect of the present invention. Said linker can be synthetic or 
isolated from a naturally occurring source. In an exemplary embodiment of the present 
invention, the linker comprises a sequence of nucleotides that encodes a central loop region of 
the hyaluronidase protein. It is preferred that the hyaluronidase is from H. baillensis. In other 

10 embodiments, the linker comprises a sequence of nucleotides that encodes a (GlySer)3 linker. 

The present invention further relates to recombinant vectors that comprise the 
synthetic nucleic acid molecules disclosed throughout this specification. These vectors may be 
comprised of DNA or RNA. For most cloning purposes, DNA vectors are preferred. Typical 
vectors include plasmids, modified viruses, baculovirus, bacteriophage, cosmids, yeast artificial 

1 5 chromosomes, and other forms of episoinal or integrated DNA that can encode a recombinant 
heparanase protein. It is well within the purview of the skilled artisan to determine an 
appropriate vector for a particular gene transfer or other use. 

An expression vector containing the synthetic nucleic acid molecules disclosed 
throughout this specification may be used for high-level expression of mammalian heparanase in 

20 a recombinant host cell. Expression vectors may include, but are not limited to, cloning vectors, 
modified cloning vectors, specifically designed plasmids or viruses. Also, a variety of bacterial 
expression vectors may be used to express recombinant heparanase in bacterial cells if desired. 
In addition, a variety of fungal cell expression vectors may be used to express recombinant 
heparanase in fungal cells. Further, a variety of insect cell expression vectors may be used to 

25 express recombinant protein in insect cells. In a preferred embodiment of the present invention, 
the vector is a baculovirus vector. 

The present invention also relates to host cells transformed or transfected with 
vectors comprising the synthetic nucleic acid molecules of the present invention. Recombinant 
host cells may be prokaryotic or eukaryotic, including but not limited to, bacteria such as E. coli 9 

30 fungal cells such as yeast including, but not limited to, Pichia pastoris, Hansenula polymorpha 
and Saccharomyces cervisiae, and insect cells, including but not limited to, Drosophila and 
silkworm derived cell lines. Such recombinant host cells can be cultured under suitable 
conditions to produce high levels of mammalian heparanase or a biologically equivalent form. 
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As defined herein, the term "host cell" is not intended to include a host cell in the body of a 
transgenic human being, human fetus, or human embryo. 

As stated above, the synthetic molecules of the present invention provide a 
significant advantage over the prior art because they are capable of expression in high-yield 
5 heterologous expression systems. The heparanase proteins encoded by the synthetic molecules 
provided herein are correctly processed, enzymatically active, and expressed to high levels. 
Therefore, in preferred embodiments of the present invention, the host cell chosen is part of a 
high yield heterologous expression system, including, but not limited to, insect cells, bacterial 
celts, and yeast cells. In a particularly preferred embodiment of the present invention, the host 

10 cell is an insect cell. 

The present invention also relates to recombinant vectors and recombinant host 
cells, both prokaryotic and eukaryotic, which contain the nucleic acid molecules disclosed 
throughout this specification. The synthetic nucleic acid molecules, associated vectors, and 
hosts of the present invention are useful in screening assays to identify inhibitors of heparanase 

15 activity, which, are useful for the treatment of cancer, inflammation and/or autoimmunity. 

In another aspect of this invention, there is provided a method of expressing 
mammalian heparanase in non-mammalian cells comprising: (a) transforming or transfecting 
non-mammalian cells with a vector comprising a sequence of nucleotides that encodes a 
mammalian heparanase protein, the sequence of nucleotides comprising two consensus cleavage 

20 sites recognized by an endoproteinase, the cleavage sites located between residues 100 and 168 
of the heparanase protein; (b) culturing the host cell under conditions which allow expression of 
said heparanase protein; and (c) exposing the cells to the endoproteinase, wherein the heparanase 
protein is cleaved at the consensus cleavage sites. 

This invention also provides substantially purified protein produced by the 

25 method described above. 

In a preferred embodiment of this aspect of the invention, the mammalian 
heparanase is human heparanase. In a further preferred embodiment, the consensus cleavage 
sites are located before residues Gl 10 and K158 of human heparanase. 

In another preferred embodiment, the cleavage sites are tobacco etch protein 

30 cleavage sites. 

Also provided herein is a method of expressing a single chain, constitutively 
active mammalian heparanase in non-mammalian cells comprising: (a) transforming or 
transfecting non-mammalian cells with a vector comprising a synthetic mammalian heparanase 
gene, wherein the synthetic gene comprises a portion that encodes the heparanase protein, the 
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protein coding portion consisting essentially of a sequence of nucleotides encoding an N- 
terminal fragment of about i kDa, a sequence of nucleotides encoding a synthetic linker and a 
sequence of nucleotides encoding a C-terminal fragment of about 50 kDa; and (b) cultunng the 
host cell under conditions which allow expression of said heparanase protein. 

5 Also provided herein is a substantially purified protein produced by the method 

described above. In a further embodiment of this invention, the protein is capable of binding an 
antibody that is specific for wild-type heparanase. 

In a preferred embodiment of this aspect of the invention, the synthetic linker 
comprises a central loop region of the hyaluronidase protein. In another preferred embodiment, 

10 the linker comprises a (GlySer)3 peptide- 

All publications mentioned herein are incorporated by reference for the purpose 
of describing and disclosing methodologies and materials that might be used in connection with 
the present invention. Nothing herein is to be construed as an admission that the invention is not 
1 5 entitled to antedate such disclosure by virtue of prior invention. 

Having described preferred embodiments of the invention with reference to the 
accompanying drawings, it is to be understood that the invention is not limited to those precise 
embodiments, and that various changes and modifications may be effected therein by one skilled 
in the art without departing from the scope or spirit of the invention as defined in the appended 
20 claims. 

The following examples illustrate, but do not limit the invention. 

EXAMPLE 1 

Pinning of heparanase from a hu man placenta cDNA library- 

25 Human heparanase (Accession No. AF 1555 10) was amplified from a normal 

human placenta cDNA library (Invitrogen Corp., Carlsbad, CA) by PCR using TaKaRaLa Taq 
polymerase (TaKaRa Bio Inc., Otsu, Shiga, Japan). Buffer conditions were those suggested by 
the supplier. PCR amplification of the cDNA templates consisted of one cycle of 94°C for one 
minute, followed by 35 cycles of 94*C for 30s, 57°C for 30s and 68°C for 110 seconds. The 

30 amplified fragment was gel purified, phosphorylated, and cloned either in the BamHl site of 
pFAST BAC1, after filling in (baculovirus expression) or into a BamYOlEcdBl- digested 
pCDNA3 vector (mammalian cell expression). The following primers were used for PCR 
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amplification and simultaneous optimization of the Kozak sequence: hHEPl-24 BamHI opti 5* - 
CGGGATCCGCCGCACCATGCTGCTGCGCTCGAAGCCTGCG-3' 
(SEQ ID NO: 1); and hHEP rev 1632 5' - TCA GAT GCA AGC AGC AAC TTT GGC - 3' 
(SEQIDNO:2). 

5 

EXAMPLE 2 
Construction of single chain heparanase molecules 

The following constructs (WT, 109, 106, GS3, GS6, GS4 and Hyaluro), 
covalently linking the 8 and 50 kDa subunits in a direct fashion (109 and 106), linking the 
10 subunits via glycine-serine spacers (GS3, GS4 and GS6) or by grafting a loop region from the 
enzyme hyaluronidase (Hyaluro) were generated by standard PCR mutagenesis using the 
indicated primers: 

hHEPl-24 BamHI opti (SEQ ID NO: 1) and hHEP rev 1632 (SEQ ID NO:2) 
15 109 Ml E109-Q157 1543 

Mutagenic primer: hHEP 304/504 5' - CTAATTTTCGATCCCAAGAAGGAAA 
AAAAGTTCAAGAACAGCACCTAC — 3' (SEQ ID NO:3) 

20 106 Ml P106-K158 1543 

Mutagenic primer. hHEP 291/504 bis 5'-AAGACAGACTTCCTAATTTTCGA 
TCCCAAAAAGTTCAAGAACAGCACCTAC-3' (SEQIDNO:4) 

25 GS3 Ml E109-(GS) 3 -Q1S7 1543 

Mutagenic primer: hIffiP_3Q4£GS3)504 5' - C T A A T T T T C G A T C C C A A G A A G G 
AAGGTAGCGGTTCCGGCTCTAAAAAGTTCAAGAAC-3' (SEQ ID 

NO:5) 

30 

GS6 Ml E109-(GS) 6 -Q157 1543 
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Mutagenic primer: hHEP 304CGS6 Ala) 5' - CTAATTTTCGATCCCAAGAAGG 
AAGGTAGCGGCGCTGGATCAGGGGCAGCAGGATCCGGCGCC 
AAAAAGTTCAAGAACAGCACCTAC (SEQ ID NO:6) 

5 GS4 Ml W118-(GS) 4 -E143 1543 

Mutagenic primer: hHEP 329(GS4 Ala) 5' - A C C T T T G A A G A G A G A A G T T A C T 
GGGGTTCAGGGGCAGGATCCGGCGCCGAATGGCCCTACCAG 

GAGCAATTG (SEQ ID NO:7) 

10 

Hyaluro Ml Wl 18-(AFKDKPT) (SEQ ID NO:8VE143 1 543 

Mutagenic primer: hHEP Hvaluro 5'-:ACCTTTGAAGAGAGAAGTTACTGG 
GCCTTCAAGGACAAGACCCCCGAATGGCCCTACCAGGAGCA 
15 ATTG-3'(SEQIDNO:9) 

EXAMPLE 3 

Construction of heparanase molecules with engineered protease cle avage sites 

To construct an engineered heparanase molecule inserting the consensus cleavage 
20 site for the tobacco etch virus (TEV) protease flanked by GS repeats (E 1 09-GSGSENLYFQ- 
GSG-G1 10 (SEQ ID NO: 10), the scissile bond being located between Q and G) between amino 
acids E109 and Gl 10, PCR mutagenesis was employed using wt heparanase as a template and 
the primers hwkpi -?4 RamTTT npti (SEQ ID NO: 1) and hHEP rev 1632 (SEQ ID NO:2) and the 
mutagenic primer TEV110 bis 5' -GGC AG CGGATCTG AG A AC CTGTACTT 
25 CCAGGGTTCCGGTTCAACCTTTGAAGAGAGAAGTTAC — 3' (SEQ 

IDNO:ll). 

To construct an engineered heparanase having TEV- cleavage sites both between 
residues E109/G1 10 and Ql 57/K158 the TEV1 10 construct was used as a template to insert the 
sequence Q 1 57-GSGSENLYFQ-GSGS-K1 58 (SEQ ID NO: 12) by PCR mutagenesis using the 
30 mutagenic primer TEV158 ter 5' -T CT G G AT C C G G TG AAA AT CTCTATTTT 
CAGGGCTCAGGAAGTAAAAAGTTCAAGAACAGCACCTAC-3' 

(SEQ ID NO: 13). 

- 15- 
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All constructs were sequenced on both strands to assure that no mutations were 
introduced by PCR and cloned into pFASTBACl as described above. 

EXAMPLE 4 

5 Transient expression of heparanase molecules in COS 7 cells. 

Cells were grown in Dulbecco's MEM (Gibco BRL, Gaithersburg, MD). All 
constructs were cloned into the eukaryotic expression plasmid pcDNA3 (Invitrogen). A vector 
encoding the reporter gene p-lactamase (BLA) was co-transfected in order to check transfection 
efficiency of each construct. The quantity of each transfected vector was adjusted in order to ( 

10 obtain comparable transfection efficiencies. Transient transfection of COS7 cells was obtained 
using the fuGENE 6 Transfection Reagent (Roche, Basel, Switzerland) according to 
manufacturer's instructions. 24 hours after transfection, efficiency was assessed by fluorimetric 
detection of BLA-positive cells. 96 hours after transfection, cells were harvested and 
resuspended in lysis buffer (50mM Tris-HCl pH 7.5, 150mM NaCl, 0.5% Triton) containing 

15 Complete protease inhibitor cocktail (Roche). The lysis was carried out on ice for 30 minutes. 
After centrifugation at 14000 rpm for 30 minutes, the heparanase containing superaatants were 
recovered and partially purified as outlined below. 

Heparanase constructs were expressed in COS-7 cells, which are devoid of 
endogenous heparanse activity, by transient transfection. Heparanase was extracted from cell 

20 lysates by heparin affinity chromatography and quantified on Western blots. In parallel, 

heparanase enzymatic activity was determined with either the radiometric or fluorimetric assay 
(FIGURE 5). From Western blot analysis, we concluded that wt heparanase as well as the single 
chain constructs GS3 and hyaluro are efficiently expressed and processed, whereas constructs 
106 and GS4 are expressed but not processed. Expression levels of constructs 109 and GS6 

25 were extremely low and barely detectable by Western blot analysis. Only the' wt, GS3 and 

hyaluro constructs showed enzymatic activity. We conclude that single chain constructs 106 and 
GS4 are inactive whereas constructs 109 and GS6 are probably unstable. Since GS3 and hyaluro 
are active but are processed despite the changes that were introduced in the cleavage sites we can 
not draw any conclusion with respect to the intrinsic activity of the precursors. We therefore 

30 proceeded with the expression in cells that are devoid of the enzyme(s) responsible for 
heparanase processing. 
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EXAMPLE 5 
Expression of heparanase molecules in insect cells. 

Recombinant baculoviruses containing the heparanase constructs were generated 
using the Bac to Bac expression system (Invitrogen). Recombinant baculoviruses were used to 
5 infect Sf9 insect cells (50x106 cells per T-175 flask) grown in Grace's insect medium with 10% 
FBS. Cells were collected 48h after infection, and centrifiiged at 500g for 5 minutes. Cell 
lysates were prepared as above using a lysis buffer with SOOmM NaCl instead of 150mM used 
for COS7 (here we used 500mM NaCl because we observed an improvement in protein quantity 
in the soluble fraction). 

1 o The three heparanase constructs that showed enzymatic activity when produced in 

COS-7 cells were transferred into a baculovirus expression system. The proteins were expressed 
in Sf9 cells and purified by heparin affinity chromatography. Western blot analysis showed that, 
in contrast to what was observed in COS-7 cells, no processing of wt or mutant heparanases 
occurred in this expression system. Analysis of the enzymatic activity of the purified single 

15 chain proteins by the fluorimetric activity assay revealed that the unprocessed wt enzyme had a 
very low activity, whereas the unprocessed GS3 and hyaluro proteins resulted to be highly 
active, with specific activities comparable to those observed with the correctly processed wild 
type enzyme produced in COS-7 cells. 

GS3 and hyaluro where undistinguishable from the wild type recombinant 

20 enzyme extracted from COS-7 cells or from the authentic wt enzyme partially purified from 

HCT-1 16 cells on what concerns pH and ionic strength dependence of the enzymatic activity and 
were inhibited with similar potencies by heparin. 

The constructs having TEV cleavage sites at positions 109/1 10 and 
109/1 104-157/158 were expressed, purified on a heparin affinity column and digested overnight 

25 at room temperature with TEV protease (0.5 \M) in 50 mM Mes pH 6.0, 10% glycerol, 0.5 mM 
EDTA. Complete processing was observed in both cases, however only the double mutant, 
carrying TEV sequences at both cleavage junctions was activated by this treatment, indicating 
that processing at the E109/G1 10 junction only is not sufficient for eliciting activation of 
heparanase. 

30 
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EXAMPLE 6 

Purification of recombinant heparanase constructs by Heparin Sepharose affinity 
chromatography, 

Cell lysates from COS7 or Sf9 insect cells were passed through SOOjul Heparin 
5 Sepharose CL-6B (Amersham, Piscataway, NJ) by gravity. The column was washed with 2ml of 
lysis buffer, then with 2ml of 50mM Tris-HCl pH 7.5, 500mM NaCl, and heparanase was 
eluted with 2ml of 50mM Tris-HCl pH 7.5, 1 M NaCl and concentrated about 5 fold with a 
Biomax-30K centrifugal concentrator (Millipore, Bedford, MA). 10% glycerol was added and 
the protein was stored in aliquots at -80°C. Protein concentration was determined using the BIO- 
10 RAD Protein Assay. 

EXAMPLE 7 

Large scale expression and purification 

Sf21 (or Sf9) cells were adapted to growth in serum free medium (Sf-900 II SFM, 
15 Invitrogen). Cells were infected with recombinant baculoviruses encoding heparanase constructs 
at multiplicities of infection varying between 1-10. 3 1 of infected cells were grown in spinner 
flasks at 27°C under a constant flux of sterile air. 48-96 hours after the infection cells were 
collected and separated from the medium by centrifugation. Synthetic and wt heparanase were 
found in both the cell pellet aiid in the supernatant. To extract synthetic heparanase from the cell 
20 pellet, cells were disrupted as outlined above. Cell lysates or the crude medium supernatant 
were filtered on a 0.22|i filter and loaded on a 20 ml-HyperD Heparin column (Biosepra Inc., 
Marlboro, MA) equilibrated with 50 mM Tris-HCl pH 7.5, 150 mM NaCl. Synthetic or wt 
heparanase were eluted by applying a linear 0.15- 1M NaCl gradient in 50 mM Tris HC1 pH 7.5. 
Recombinant proteins eluted at NaCl concentrations >500 mM. The pooled, heparanase- 
25 containing Heparin-column fractions were dialyzed overnight against 50 mM HEPES pH 7.5 and 
loaded on a Source S column (Amersham) equilibrated in the same buffer. Heparanase 
constructs eluted with 400-600 mM NaCl. Proteins were purified to homogeneity by a further 
chromatographic step on a 15/30 Superdex 75 size exclusion column. The purified proteins were 
aliquoted, shock-frozen in liquid nitrogen and stored at -80 °C. 

30 
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EXAMPLE 8 

Western Blotting. 

Rabbit polyclonal antibodies were generated against a peptide contained within 
the 50 kDa subunit (EPNSFLKKADIFINGSQ (SEQ ID NO: 14), corresponding to amino acids 

5 225 to 241 and containing the additional sequence GGC at its C-terminus). Antisera were 
immunopurified using the immunogen peptide immobilized on a thiopropyl Sepharose resin 
(Amersham). lOul of proteins eluted from the heparin column were subjected to 10% SDS- 
polyacrylamide gel electrophoresis and transferred onto Protran BA 83 Cellulosenitrate 
membrane (Schleicher & Schuell Bioscience, Keene, NH). After saturation of non specific 

10 binding with 5% milk, the membrane was incubated with the polyclonal antibody described 

above diluted 1:500 in 5% milk, TBS and 0.05% Tween20 over night at 4°C. After washing, the 
membrane was incubated with anti-rabbit horseradish peroxidase-conjugated antibody diluted 
1:5000 for 30' at room temperature. The immunoreactive bands were detected by SuperSignal 
West Pico Chemiluminescent Substrate (Pierce Biotechnology, Rockford, IL). Finally the 

15 membrane was exposed to BIOMAX MR film (Kodak) for 1 0s. 

EXAMPLE 9 

Fluorometric labeling of heparan sulfate 

Heparan sulfate sodium salt from bovine kidney (Sigma-Aldrich Corp., St. Louis, 
20 MO) was labeled with fluorescein isothiocyanate (FITC) as previously described (Toyoshima 
and Nakajima, J. Biol. Chem. 274: 24153-24160 (1999)). 5 mg of heparan sulfate and 5 mg of 
FITC were dissolved in 1 ml of 0.1 M Na2C03 pH 9.5 and incubated over night at 4°C in the 
dark. The solution was then loaded on MicroSpin G-25 columns in order to separate FITC 
labeled Heparan Sulfate (FITC-HS) from unreacted FITC. The FITC-HS was subjected to a first 
25 gel-filtration chromatographic step through Sephacryl S-300 in 1 50mM NaCl, 25mM Tris-HCl 
pH=7.5 buffer to separate the high molecular weight heparan sulfate species. The colored 
fractions were pooled, concentrated with Biomax-IOK centrifugal concentrator (Millipore) and 
rechromatographed on Sephacryl S-300 (as above) in order to obtain heparan sulfate species 
with homogeneous molecular weight The eluted fractions were analyzed by HPLC Superdex 
30 75™ (Pharmacia Biotech) chromatography system. The fluorescence in each fraction was 
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measured by an L-7485 fluorescence detector (Merck Hitachi). We obtained four main fractions 
with different molecular weight heparan sulfate products. The quantity of FITC-HS in each 
fraction was measured with the Blyscan Glycosaminoglycan Assay (Biocolor Ltd., Belfast, 
Northern Ireland). 

5 

EXAMPLE 10 

Fluorimetric assay. 

This assay is based on the degradation of FITC-HS monitored by HPLC size 
exclusion chromatography. 8|jl1 of purified heparanase was incubated with 5^1 of FITC-HS in a 

10 SOjil of 50 mM MES pH 6, 10% glycerol (heparanase activity buffer, HAB). The reaction 

mixture was incubated at room temperature for a defined period and the reaction was stopped by 
the addition of 50|ig of heparin. The mixture was then filtered using Ultrafree-MC centrifugal 
filter Devices (Millipore). 20*il were injected on a Superdex 75™ (Pharmacia Biotech) column 
equilibrated in buffer 50mM Hepes pH 7.5 ISOmM Na 2 S0 4 and connected to a Merck-Hitachi 

1 5 HPLC system. Fluorescent heparan sulfate degradation products were detected by an L-7485 
fluorescence detector. Heparanase activity was assessed by monitoring the increase in lower 
molecular weight heparan sulfate species compared with the intact FITC-HS and quantified by 
peak area integration. 

20 EXAMPLE 11 

Radiometric labeling and biotinvlation at the reducing end of heparan sulfate 

lOmg of heparan sulfate sodium salt from bovine kidney (Sigma) were partially 
N-de-acetylated and re-acetylated with pH] acetic anhydride as previously described (Freeman 
and Parish, Biochem. J. 325: 229-237 (1997)). Tritiated heparan sulfate was then subjected to 

25 reductive amination at the reducing end as described. Tritiated, reductively aminated heparan 
sulfate was further conjugated to biotin using EZ-Link Sulfo-NHS-LC-Biotin (Pierce). This 
biotin analog has an N-hydroxysuccinimido ester moiety that can react with the amino group 
generated at the reducing end of the heparan sulfate molecules. We calculated a recovery in 
about 5mg of tritiated heparan sulfete, reductively aminated and resuspended in 1 ml of H2O (an 

30 estimated final concentration of 100 nM taking into account an average in heparan sulfate 
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molecular weight of 500KDa). To lOOjil of this solution Img of EZ-Link Sulfo-NHS-LC-Biotin 
(about 100-fold molar excess) and 20^1 of phosphate buffer pH 7.5 were added. The reaction 
mixture was incubated overnight at room temperature. The reaction mixture was then loaded on 
PD-10 desalting column in order to separate biotinylated, tritiated heparan sulfate from 
5 unreacted biotin. We finally obtained four fractions (1 ml each), which were tested for their 
ability to be immobilized on Reacti-Bind Streptavidin High Binding Capacity Coated Plates 
(Pierce). 

EXAMPLE 12 

10 Radiometric assay. 

This assay is based on the degradation of tritiated heparan sulfate immobilized on 
microplate. Each well of the Reacti-Bind Streptavidin High Binding Capacity Coated Plates was 
pre-treated according to manufacturer's instructions. Initially, different amounts of each fraction 
of tritiated, biotinylated heparan sulfate obtained after PD-10 desalting column were added to 

15 each well (in duplicate) in PBS to a final volume of lOOjil. After assesseing that the maximum 
binding is obtained with a volume of fraction 2 corresponding to 100x103 d.p.m. this amount 
was always used. The binding was carried out over night at room temperature. The wells were 
then washed three times with PBS and twice with HAB. 1 Opl of purified heparanase were added 
to each well in HAB to a final volume of lOOjil. The wells were incubated at room temperature 

20 for 2-24 hours. Finally, the liberated radioactivity due to tritiated heparan sulfate products 
generated by heparanase in each well was measured and normalized against a buffer blank. 

EXAMPLE 13 

Determination of specific activity of Heparanase constructs. 

25 Specific activities of the heparanase constructs either transiently expressed in 

COS7 cells or expressed in the baculovirus system were determined as follows: 

Specific activity = normalized activity (d.p.m./uD 

normalized densitometric volume (volume/pi) 

30 
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In detail, activity of partially purified heparanse constructs was determined in the 
radiometric assay by titrating each preparation in such a way that a linear dose-activity 
relationship was observed. These titrations were repeated three times with each preparation and 
a mean, normalized activity (d.p.nx/jii) was calculated. Protein expression was determined by 
5 the Western blotting experiments: the chemiluminescent readout was quantified by densitometry. 
Again, experiments were repeated three times and mean values were determined. The specific 
activity was obtained by dividing the normalized activity (d.p.m7fil) by the normalized 
densitometric volume (volume/pi). 
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WHAT IS CLAIMED IS: 

1 . A synthetic nucleic acid molecule comprising a sequence of nucleotides 
that encodes a mammalian heparanase protein, the sequence of nucleotides comprising two 

5 consensus cleavage sites recognized by an endoproteinase, the cleavage sites located between 
residues 100 and 168 of the heparanase protein. 

2. A vector comprising the nucleic acid molecule of claim 1 . 
10 3. A host cell comprising the vector of claim 3. 

4. The host cell of claim 3, wherein the host cell is an insect cell. 

5. The host cell of claim 3, wherein the host cell is a yeast cell. 

15 

6. The host cell of claim 5, wherein the yeast is selected from the group 
consisting of: Pichia pastoris, Hansenula pofymorpha and Saccharomyces cervisiae. 

7. The synthetic nucleic acid molecule of claim 1, wherein the heparanase 
20 protein is human heparanase. 

8. The synthetic nucleic acid molecule of claim 7, wherein the consensus 
cleavage sites are located before residues Gl 10 and K158 of the human heparanase protein. 

2 5 9. A vector comprising the synthetic nucleic acid molecule of claim 8. 

10. The vector of claim 9, wherein the vector is a baculovirus vector. 

11. A host cell comprising the vector of claim 9. 

30 

12. The host cell of claim 1 1 , wherein the host cell is an insect cell. 

13. The host cell of claim 1 1 , wherein the host cell is a yeast cell. 
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14. The synthetic nucleic acid molecule of claim 7, wherein the consensus 
cleavage sites are tobacco etch virus (TEV) protease cleavage sites. 

15. The synthetic nucleic acid molecule of claim 7, wherein the consensus 
cleavage sites are 3C protease cleavage sites from picomavirus. 

16. The synthetic nucleic acid molecule of claim 7, wherein the consensus 
cleavage sites are thrombin protease cleavage sites. 

17. The synthetic nucleic acid molecule of claim 7, wherein the consensus 
cleavage sites are enterokinase cleavage sites. 

1 8. The synthetic nucleic acid molecule of claim 7, wherein the consensus 
cleavage sites are Factor Xa cleavage sites. 

19. A constitutively active, single-chain mammalian heparanase nucleic acid 
molecule comprising a portion that encodes a mammalian heparanase protein, the protein coding 
portion consisting essentially of a sequence of nucleotides encoding anN-tenninal fragment of 
about 8 kDa, a linker, and a sequence of nucleotides encoding a C-terminal fragment of about 50 
kDa, wherein the N-terminal and C-terminal fragments encode protein fragments that are 
substantially similar to wild-type fragments. 

20. The gene of claim 19, wherein the protein coding portion encodes human 

heparanase. 

2 1 . The gene of claim 20, wherein the linker comprises a sequence of 
nucleotides that encodes a central loop region of the hyaluronidase protein. 

22. The gene of claim 2 1 , wherein the hyaluronidase is from H. baillensis. 

23. The gene of claim 20, wherein the linker comprises a sequence of 
nucleotides that encodes a (GlySer)3 linker. 

24. A vector comprising the gene of claim 20. 
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25 . A host cell comprising the vector of claim 24. 

26. The vector of claim 24 which is a baculovirus vector. 

5 i. 

27. The host cell of claim 25 which is an insect cell. 

28. The host cell of claim 25 which is a yeast cell. 

10 29. A purified synthetic heparanase protein encoded by the gene of claim 20. 

30. A method of expressing mammalian heparanase in non-mammalian cells 

comprising: 

(a) transforming or transfecting non-mammalian cells with a vector 
15 comprising a sequence of nucleotides that encodes a mammalian heparanase protein, the 

sequence of nucleotides comprising two consensus cleavage sites recognized by an 
endoproteinase, the cleavage sites located between residues 100 and 168 of the heparanase 
protein; 

(b) culturing the host cell under conditions which allow expression of said 
20 heparanase protein; and 

(c) exposing the cells to the endoproteinase, wherein the heparanase 
protein is cleaved at the consensus cleavage sites. 

31. A method as in claim 30, wherein the heparanase is human. 

25 

32. A method as in claim 31, wherein the consensus cleavage sites are located 
before residues Gl 10 and K158. 

33. A method as in claim 3 1 , wherein the vector is a baculovirus vector. 

30 

34. The method of claim 3 1 , wherein the cells are insect cells. 

35. The method of claim 3 1 , wherein the consensus cleavage sites are tobacco 
etch virus protease cleavage sites. 
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36. A method of expressing a single chain, constitutively active mammalian 
heparanase in non-mammalian cells comprising: 

(a) transforming or transfecting non-mammalian cells with a vector 

5 comprising a synthetic mammalian heparanase gene, wherein the synthetic gene comprises a 

portion that encodes the heparanase protein, the protein coding portion consisting essentially of a 
sequence of nucleotides encoding an N-terminal fragment of about 8 kDa, a sequence of 
nucleotides encoding a linker and a sequence of nucleotides encoding a C-terminal fragment of 
about 50 kDa, wherein the N-terminal and C-terminal fragments encode protein fragments that 

10 are substantially similar to wild-type fragments; and 

(b) culturing the host cell under conditions which allow expression of said 

heparanase protein 

37. The method of claim 36 wherein the linker comprises a central loop region 
15 of the hyaiuronidase protein. 

38. The method of claim 36 wherein the linker comprises a central (GlySer)3. 

39. A substantially pure protein produced by the method of claim 36. 



20 



40. The protein of claim 39 which is capable of binding an antibody that is 
specific for wild-type heparanase. 
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TITLE OF THE INVENTION 

SYNTHETIC HEPARANASE MOLECULES AND USES THEREOF 

ABSTRACT OF THE DISCLOSURE 

The present invention relates to synthetically produced, enzymatically active 
heparanase nucleic acid molecules that are capable of expression in high yield heterologous 
expression systems, and to polypeptides encoded by said molecules. Also provided herein are 
methods of expressing mammalian heparanase in heterologous expression systems, wherein high 
yields of biologically active heparanase are produced compared to prior art methods. 
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FIGURE 1. Biosynthesis of Heparanase. 
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